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EXTRA CTING SOFT INFORMATION IN 
A BLOCK-COHERENT COMMUNICATION SYSTEM 

Related Applications 

5 

The present invention claims the benefit of U.S. Provisional Patent Application 
S.N. 60/459,677, filed April 2, 2003 titled "METHODS AND APPARATUS FOR DATA 
TRANSMISSION IN A BLOCK-COHERENT COMMUNICATION SYSTEM" and pending 
U.S. Patent Application S.N. 10/637,844, filed August 8, 2003 titled "METHODS AND 
1 0 APPARATUS FOR DATA TRANSMISSION IN A BLOCK-COHERENT 
COMMUNICATION SYSTEM". 

Field Of The Invention 

1 5 The present invention is directed to methods and apparatus for data 

communication over a block-coherent channel and, more particularly, to methods and apparatus 
for joint demodulation and decoding for the purpose of forward error detection and/or 
correction. 

2 0 Background 

A communication system requires noncoherent detection when it is infeasible for 
the receiver to maintain a reliable estimate for instantaneous channel gain (magnitude and, 
especially, phase). Noncoherent communication systems include, for instance, a wireless 

2 5 multiple access system where the mobile nodes have limited power and cannot afford to transmit 

high power known symbols, e.g., pilots, to enable reliable channel estimation. A noncoherent 
communication channel may possess some coherence properties. For example, a noncoherent 
communication channel may include coherent blocks, where a coherent block is a time interval 
during which the channel variations are negligibly small. Communication over such a channel is 

3 0 referred to as block- coherent communication. 

Block-coherent communication may arise naturally in fast frequency-hopping 
orthogonal frequency division multiple (OFDM) access systems. In such systems information 
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may be modulated onto a subset of available frequencies, called tones, in every symbol time. To 
enhance spectral efficiency and increase diversity gain, tones utilized are, in some cases, rapidly 
hopped across the entire utilized frequency band in every L symbols, e.g., L consecutive 
symbols are mapped to one tone, followed by another L symbols mapped to a different tone, and 
5 so on. When L is small, it is possible to assume the consecutive L symbols experience identical 
channel gain. Although the amplitudes of the gains of two consecutive sets of L symbols can be 
close, their phases are normally completely independent. 

More precisely, a block-coherent communication system can be defined as 
1 0 follows: for a system represented in discrete time domain, the channel gain is an unknown 

complex random variable that generally remains the same for every L consecutive symbols but 
otherwise varies independently according to some distribution, e.g., the phase is uniformly 
distributed over [0, 2PI] and the magnitude is Rayleigh distributed. 

15 For block-coherent communication the nominal modulation scheme is differential 

M-array phase-shift-keying (DMPSK). DMPSK carries the information in the phase differences 
between two successive symbols over the coherent block. For illustration, to transmit N x (L-l) 
MPSK information symbols s(i), each of the N consecutive sets of L-l symbols, denoted as s(l), 
s(2), s(L-l), is differentially encoded to transmitted symbols t(0), t(l), t(2), t(L-l), where 

2 0 t(0) is set to a known symbol, and t(j) = t(j-l) x s(j) for j=l, N-l. 

Modulation schemes other than DMPSK are possible. For instance, with the 
insertion of known symbols in a block, information symbols may be transmitted directly on 
other symbols instead of differentially. This modulation scheme may be referred to as pseudo- 
2 5 pilot modulation. It is apparent, however, that at most L-l information symbols can be 

transmitted inside a dwell of length L due to the phase uncertainty. Hence we may assume a 
known symbol is present in each dwell. Using the notation in the above illustration, t(0) is set to 
a known symbol, and the rest transmitted symbols are t(j) = s(j) for j=l, L-l. 
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With forward error-correction coding, a block-coherent communication system 
will normally include an encoder (which inserts structured redundancy into original data 
stream), a modulator, e.g. DMPSK, (which maps binary data bits to MPSK symbols), a 
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demodulator (which extracts out soft information and feeds it to the decoder), and a decoder 
(which decodes the original message based on soft information from the demodulator). 



With block-coherent reception, the received symbol y(i) and the transmitted 
5 symbol t(i) are related as follows: 



y(i) = ae 1 t(i) + n(i) , 



where □ is the unknown phase, □ is the unknown (real) channel gain, and n(i) is the additive 
1 0 noise component. 

In most coded systems, a receiver applying iterative demodulation and decoding - 
a scheme henceforth referred to as turbo equalization - has significant performance gain over a 
non-iterative receiver. For instance, convolution and/or turbo coded DMPSK systems, 
15 investigated by Shamai et aL in "Iterative decoding for coded noncoherent MPSK 
communications over phase-noisy AWGN channel" published in IEE Proceedings 
Communication 2000, demonstrates turbo-equalization performance within 1.3 dB of channel 
capacity and 1 dB better than traditional schemes. 



2 0 It has been shown that for turbo equalization to be maximally effective, the code 

design should take the effect of iterative demodulation into account. The importance of code 
design and an effective way of achieving it are described in Jin and Richardson's paper "Design 
of Low-Density Parity-Check Codes in Noncoherent Communication," published in 
International symposium on information theory 2002. The approach therein improves the 

2 5 performance to within 0.7 dB of channel capacity. 



While the performance of turbo equalization is important, for a communication 
system to be practical for use in a wide range of devices, e.g., consumer devices, it is important 
that the turbo equalization be capable of being implemented at reasonable cost. Accordingly, the 
3 0 ability to efficiently implement turbo equalization schemes used for a block-coherent 

communication system, e.g., in terms of hardware costs, can be an important consideration. 
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The practical challenges posed by turbo equalization, in the light of 
implementation cost, are (i) the complexity of soft-in soft-out (SISO) demodulator and (ii) the 
data interleaving necessary at the transmitter and the receiver. 

5 One known method of implementing a SISO demodulator is to apply belief 

propagation, e.g. Bahl-Cocke-Jelinek-Raviv (BCJR) algorithm if DMPSK modulation is used. 
Such a demodulator requires considerable complexity. BCJR algorithm runs on a trellis structure 
resulted from quantizing the phase space [0, 2PI] into equally spaced phase points. For instance, 
a 8-level quantization forms 8 points, 0, 1/4 PI, 7/4 PI. Therefore, the unknown phase 
10 associated to a dwell can only be one of those points, so are the phases of the received symbols, 
given no additive noise. L symbols inside a dwell, each being one of the eight states, comprises 
the trellis structure. The information symbol determines the transition from the current state to 
the next state. On this trellis, BCJR algorithm returns a soft guess on the information symbol. 
The complexity of BCJR algorithm is linear in the cardinality of the state space. 

15 

Implementing belief propagation demodulation for pseudo-pilot modulation 
entails similar complexity - linear in the cardinality of the quantization space. 

In view of the above, it is apparent that there is a need for methods and apparatus 
2 0 which address the complexity of the soft-in soft-out demodulation. There is a need in block 

coherent communications systems for low complexity demodulation methods and apparatus that 
achieve good performance. 

Brief Description of the Figures : 

25 

Figure 1 illustrates an exemplary communication system including an encoder, a 
first interleaver, a modulator, a communications channel, a demodulator, a deinterleaver, a 
second interleaver, and a decoder in which the methods and apparatus of the present invention 
are employed. 
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Figure 2 illustrates an exemplary structure of joint demodulation and decoding 
with data interleaving and de-interleaving in an exemplary receiver in accordance with the 
present invention. 
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Figure 3 illustrates an exemplary soft-in soft-out demodulator implemented in 
accordance with the present invention which may be used as the demodulator in Fig. 2. 

5 Figure 4 is an illustration of an exemplary embodiment of a module implemented 

in accordance with the invention which produces updated independent phase estimates from 
independent phase estimates using the known extrinsic rule. 

Figure 5 is a flowchart illustrating an exemplary method in accordance with the 
1 0 present invention. 

Figure 6 is a drawing illustrating exemplary sets of information in accordance 
with the present invention. 

15 Summary of the Invention : 

The present invention is directed to methods and apparatus for data 
communication over a block-coherent channel. For simplicity, the symbols transmitted in one 
coherent interval are referred to as a dwell, in this application. The length of a dwell is L. A 
2 0 dwell, comprising L symbols, serves as a unit for information transmission purposes. 

Various examples used to explain the invention involve the use of Gray mapped 
QPSK symbols, with constellation points SO = 1+j, SI = S2 =-l-j, and S3 = 1-j on the 
complex plane. A pair of two bits, a.k.a. a dibit, maps to one QPSK symbol: a dibit 00 maps to 

2 5 SO; a dibit 01 maps to SI; a dibit 11 maps to S2; and a dibit 10 maps to S3. Symbolically, we 

use S(clc2) to denote the QPSK symbol mapping for a dibit (cl c2), then S(00)=S0, etc. 

It can be appreciated the techniques described in the present application in the 
exemplary context of QPSK, can be easily generalized and are applicable to high order MPSK 

3 0 with such implementations being deemed within the scope of the present invention. 

Various features of the present invention are directed to methods and apparatus 
for a low-complexity SISO demodulator suitable for processing pseudo-pilot modulated 
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information corresponding to each of one or more dwells. The low-complexity method of the 
invention provides good performance under turbo equalization. 



In summary, this is accomplished by first independently estimating the unknown 
5 phase based on each of L pairs of information, each pair including a received symbol y(i) and 
updated a priori information on a transmitted symbol t(i) feed from the decoder. That produces 
L independent estimations, one for each symbol index i. Secondly, a new phase estimation for 
an i th symbol may be formed by averaging of the estimation from the other L-l symbols, 
following the extrinsic principle which is described, in C. Berrou, A. Glavieux, P. 
10 Thitimajshima, " Near Shannon limit error-correcting coding and decoding: Turbo codes " 

Proceedings 1993 IEEE International Conference on Communications, Geneva, Switzerland, 
May 1993, pp.1064 -1070. 

Finally, the new phase estimation can be used to generate soft information about 
15 the symbol (and thus the two bits associated with it). This technique does not require quantizing 
the unknown phase and thus entails little complexity. 

In an exemplary SISO demodulation embodiment, a decoder feeds the 
demodulator soft messages, m(2i) and m(2i+l), for bits c(2i), c(2i+l) associated with QPSK 
2 0 symbol t(i). Soft messages can be in the form of log-likelihood ratio, meaning the logarithm of 
the ratio that the probability that bit c is a 0 over the probability that bit c is a 1, i.e., m(i) = 
log(p(c(i)=0)/p(c(i)=l). Log-likelihood ratios of both bits c(2i) and c(2i+l) can be manipulated 
to obtain the expected value of symbol t(i): 

2 5 E(t(i)) = (tanh(m(2i)/2), tanh(m(2i+l)/2)), 

and the expected value of the conjugate of s(i): 

E(t(i)*) = (tanh(m(2i)/2), -tanh(m(2i+l)/2)). 
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Due to the distinctive property of tangent-hyperbolic function, simplification of 
this expected value by approximation is possible with little performance degradation. For 
example, assume soft-messages are quantized in ln(2) steps, an exemplary approximation is: 
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-3/4 when m<=-10 
-1/2 when -10<m<=-6 
-1/4 when -6<m<-l 
5 tanh(m/2)= 0 when -l<=m <=1 Eq. (1) 

1/4 when l<m<6 

1/2 when 6<=m<10 
3/4 when m>=10. 

10 Under this approximation three bits suffice to represent both the real and complex component of 
the expected value of symbol t(i). 

A known symbol t(0), which can be assumed for purposes of explaining the 
invention in an exemplary embodiment to be 1+j without loss of generality ,_inside the dwell 
15 deserves somewhat special treatment. Its expected value E(t(0)) is known and thus set to be 

t(0)=l+j, and E(t(0)*) is set to be t(0)* = 1-j. This is equivalent to setting tanh(m/2)=l. In other 
embodiments, the known symbol t(0) may be a different known value, and E(t(0)) and E(t(0)*) 

may be set accordingly. Given y(i) = (Xe* e t(l) + n(j) , multiplying y(i) with E(t(i)*) generates 

an estimation of a e , referred to as T(i), for every symbol inside the same dwell. T(0) is an 

2 0 estimation of a e J for the known symbol inside the dwell. 

Therefore inside one dwell, we can obtain L independent estimates T(i) which 

include independent estimates of the unknown phase e . These independent estimates can 
now form a new set of L estimates which include new estimates of the unknown phase, T\i) , 

2 5 following the extrinsic principle, 

L-l,k*i 

T\i) = ( £r(*) +A7X0)). Eq.(2) 

Subsequently, multiplying y(i) with the conjugate of T\i) gives an approximation of 

3 0 a 2 t(i)+an(i); therefore, the soft messages of m(2i) and m(2i+l) will be proportional to the real 
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and imaginary component of y(i) T'(i) . Proper scaling, by shifting bits with truncation and/or 
saturation, converts them into the right form of log-likelihood ratio. 

In principle, the weight coefficient "a" should be tuned adaptively during turbo 
5 equalization for optimal performance. In practice, however, an exemplary constant setting a=2 
provides, in some implementations, near optimal performance. As a constant multiplication can 
be incorporated into the mapping step, when implementing Eq. (1) in software or hardware, this 
further simplifies implementation. 

10 Detailed Description of the Invention ; 

Fig. 1 illustrates an exemplary general-purpose communication system 10 which 
implements the present invention. In some embodiments, communications system 10 may be, 
e.g., an OFDM communications system. The system 10 includes a transmitter 100 which is 

15 coupled by a communication channel 1 10 to a receiver 120. In other embodiments, the 

transmitter 100 may be part of a wireless terminal, the receiver may be part of a base station, and 
the communications channel may be an uplink communications channel. In some embodiments, 
the transmitter 100 may be part of a base station, the receiver may be part of a wireless terminal, 
and the communications channel may be an downlink communications channel. The transmitter 

2 0 100 comprises a data encoding circuit, e.g., an encoder 101, an interleaver 102, and a modulator 
103. The receiver comprises a demodulator 121, a deinterleaver 122, an interleaver 123, a data 
decoder 124, and a sync signal generator 151. Decoder 124 may be a Turbo decoder, an LDPC 
decoder, or another type of decoder. The encoder 101 maps an input binary data stream A to a 
structured binary data stream Xi with redundancy. The interleaver 102 interleaves Xi to produce 

2 5 another data stream X2. The modulator 103 transforms the binary stream X 2 to physical signals S 

feasible for practical transmission, e.g. QPSK signals. The communications channel 110 may 
be, e.g., an air link. After modulation signals t are transmitted through the channel 1 10 to the 
receiver 120 and received as signals Y. The demodulator 121, deinterleaver 122, and interleaver 
123 form a combined demodulation and interleaver device 150. The demodulator 121 and data 

3 0 decoder 124 supply symbol timing and other information to sync signal generator 151. The sync 

signal generator 151 generates from the supplied information a dwell sync signal which 
indicates the alignment of received symbols within a dwell. This information allows a device 
supplied with the dwell sync signal to determine which symbol within the dwell is being 
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processed at any particular point in time. At the receiver side, demodulator 121 extracts 
information X 2 ' from the noisy distorted reception Y. The deinterleaver 122 reorders the soft 
messages X 2 ' to Xi' corresponding to the original ordering of the code structure. And the 
decoder 123 tries to recover the original binary data stream A through the use of redundancy 
5 present in the coded data stream X 2 ' produced by demodulation. The output from data decoder 
124 is A' representing the receiver's estimated recovery of original data stream A. The data path 
from data decoder 124 to demodulation 121 represents a feedback loop. The feedback path 
includes interleaver 123. The feedback messages corresponding to Ui are interleaved by an 
interleaver 123 to transform the ordering at the decoder to the ordering at the output of 
10 modulator 103 to producing messages corresponding to U 2 . 

Figure 2 is a drawing of an exemplary combined demodulation and interleaver 
device 150' that may be used in place of device 150 of Figure 1. Exemplary demodulation and 
interleaver device 150' illustrates an exemplary implementation of joint decoding and 

15 demodulation at the receiver side, coupled with a data interleaving and de-interleaving control 
circuit 202, in accordance with the present invention. Feedback messages from the decoder, 
corresponding to Ui, are stored in soft-output (from decoder) memory (module 200). Soft- 
outputs are generated from received signals Y and stored in soft input (from channel) memory 
(module 208). The soft-outputs from the decoder memory(module 200) and soft-inputs from the 

2 0 channel receiver memory(module 208) are assumed to have the same configuration as the coded 
bits. Interleaving control circuit 202 generates the correct address to access both soft-outputs 
from the decoder memory and soft-inputs from the channel memory in the ordering of dwells, 
resulting in the forwarding of messages corresponding to U 2 and Y to demodulator 204. 
Demodulator 204 extracts information X 2 ' from the noisy distorted reception Y. The same 

2 5 generated addresses, generated by the interleaving control circuit 202 for the access of 

information being input to the demodulator 204, are subjected to a delay introduced by a delay 
line 210. The output of delay line 210 serves as write addresses used to control the writing of 
the soft inputs of information X 2 ', which are outputs of demodulator 204,into the decoder 
memory (module 206), and supplied to the decoder as Xi'. 

30 

Write and read addresses supplied to memory 202, 206 may cause soft values 
corresponding to known signals, e.g., pilot signals or pseudo pilot signals, to be written to a 
dummy location. These values are not important since known soft values corresponding to 
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pilots or pseudo pilots can be used in their place. However, it is important to use actual 
addresses for information corresponding to data symbols since such values are not fixed and are 
important to data recovery. Writing pilot or pseudo pilot related values to dummy addresses is 
done in some embodiments to provide an easy way of maintaining timing synchronization with 
5 the stream of received symbols to be processed. 

Figure 3 illustrates an exemplary demodulator 204 which performs demodulation 
in accordance with one embodiment of the invention. The demodulator 204 can be used as the 
exemplary demodulator in Figure 2. In the illustration of Figure 3, subscript x for an exemplary 
10 complex symbol, e.g., representing U2, X2, Y, IPE, or UBPE, indicates the real component of the 
complex symbol, and subscript y indicates the imaginary component of the complex symbol. 
Thus, in the Fig. 3 example, Uz* and U2 y represent the I and Q parts of U2, Y x and Y y represent 
the I and Q parts of Y and X' 2x and X' 2y represent the I and Y parts of X2. 

15 In the demodulator 300, mapping module 301 maps soft messages m(2i) and 

m(2i+l) from the decoder, associated respectively to the real and imaginary components of 
symbol U(i), each to a three-bit value according to Eq.(l). Sub-module 311 performs the 
mapping of soft messages m(2i) associated with U x to a three-bit value, while sub-module 313 
maps soft-messages m(2i+l) associated with U y to produce another three bit value. Mapping 

2 0 module 301 also includes a weighting coefficients "a" 319, which may be utilized, e.g., as a 
multiplier, by the mapping sub-modules (311, 313) with known symbols, e.g., a pseudo pilot 
symbol. In some embodiments, the weighting coefficient, "a" (319) may be set to a constant 
value, e.g., 2. In other embodiments, the weighting coefficient "a" may be adaptively tuned, 
e.g., during turbo equalization. This mapping of module 301 forms the expected value of real 

2 5 and imaginary component of t(i)* (conjugate of t(i)), and thus the expected value of t(i)*. The 

outputs from mapping module 301 are soft bits corresponding to complex symbol values. In 
some embodiments, a Look-Up-Table may be used for this mapping. Based upon dwell sync 
signal 323, the mapping modules 311, 313 can determine when a known symbol, e.g. a pseudo 
pilot symbol or pilot symbol, is being processed. The processing of the known symbol, e.g., 

3 0 pseudo pilot symbol or pilot symbol, may involve the use of a known value in place of the soft 

value supplied to the soft value input of mapping module 301. Thus mapping may be performed 
differently for known symbols, e.g., pseudo pilot or pilot symbols, than for data symbols. 
Similarly, based upon the dwell sync signal 323, complex multiplier A 303 may process values 
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corresponding to known symbols, e.g., pseudo pilot symbols or pilot symbols, differently than 
symbols corresponding to data symbols. For example, first complex multiplier 303 may limit the 
multiplication in the case of a known symbol, e.g., a pseudo pilot symbol or pilot symbol, value 
to a real multiplication operation to avoid a rotation in phase which can occur in the case of 
complex multiplications performed on values corresponding to a data symbol in a dwell. The 
Complex multiplier A 303 produces the product of the expected value of t(i)* and received value 
y(i). The resulting product is the independent phase estimation (IPE) for each symbol in a 
dwell. Separate real (BPE X ) and imaginary (LPE y ) IPEs are generated. Module 305 processes the 
components of the complex IPE to generate an updated independent phase estimation (UIPE) 
which includes real and imaginary components for every symbol inside a dwell. Module 305 
uses known symbol information, e.g., pilot or pseudo pilot symbol information, and knowledge 
of the dwell structure in the processing, e.g., the UIPE module 305 may determine from the 
dwell structure where a pilot or pseudo pilot will occur in a sequence of symbols. Dwell sync 
signal 323 is directed as an input to module 305. This processing to generate the UIPE can be 
implemented independently for each of the real and imaginary components. In Figure 3, 
extrinsic estimate/update sub-module 315 processes IPE X to generate UIPEy, while extrinsic 
estimate/update sub-module 317 processes IPE y to generate UIPEy. 

The generated UIPE X and UIPEy are input to a conjugate circuit 306. Conjugate 
2 0 circuit 306 performs a conjugation operation on the signals supplied thereto. In various 

implementations, conjugate circuit 306 and complex multiplier B 307 are combined into a single 
circuit. 

A complex multiplier B 307 then multiplies the UIPE* with the received value 

2 5 y(i) to generate the new soft messages m'(2i) and m'(2i+l), up to a scaling constant. The 

received value y(i) might arrive from the same source as the input to module 303, delayed by a 
delay line module 309 that matches the pipeline delay in module 305. The output from complex 
multiplier B is X' 2 (i). 

3 0 In one particular exemplary embodiment, module 305 produces the complex 

UIPE following the known extrinsic rule. Figure 4 illustrates such an exemplary 
implementation of a module 400 implemented using the extrinsic rule, which can be used as the 
UIPE module 305 of Figure 3. In such an embodiment, an accumulator 401 generates the 
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summation of all the IPE T(i) in a dwell. The accumulator 401 includes a summing module 403, 
a delay element 405 (with a delay of 1 time interval corresponding to one symbol being 
processed), and a feedback loop. The output of the summing module 403 is input to the delay 
element 405; the output of the delay element 405 is fed back as an input to the summing module 
5 403 to be summed with the EPE input signal. The output of the delay element 405 is also 

coupled to the input of a sum register 407. The summation of all IPE inside a dwell is locked, 
e.g., stored, in the sum register 407, where the lock is controlled by a signal dwell lock 409, 
indicating the boundary of dwells. In this exemplary embodiment, it is assumed that there are L 
symbols in a dwell. Reset signal 411 can be used to clear the accumulator 401 and restart 

10 summing for a subsequent dwell. Sum register 407 retains the locked sum value corresponding 
to the last dwell, while accumulator 401 starts accumulation of a sum corresponding to a new 
dwell. Module 400 also includes a control module 415 which can generate the reset signal 411 
and the dwell lock signal 409. Dwell sync signal 323 is an input to the control module 415 and 
may be used as a trigger by control module to generate signals 411 and 409 at the appropriate 

15 times. Dwell sync signal 323 is generated from sync signal generator 151, and is used to provide 
synchronization capability which facilitates identifying dwell boundaries. 

Module 400 also includes a delay element 413 (with a delay of L time intervals 
corresponding to L symbols of a dwell being processed) and a subtraction module 415. Delay 
2 0 element 413 performs a delay of L on each IPE symbol received as input. Z. L 413 is a pipeline 
delay element capable of storing multiple IPE values. The delay 413 can be implemented as a 
series of L unit delay elements. Other implementations are also possible. Subtraction module 
415 receives as input the outputs of delay element 413 and the dwell symbol sum value locked 
in sum register 407. The output of subtraction module 415 is the UEPE. Subtracting the EPE of 

2 5 the ith symbol from the sum (of the dwell set including the ith symbol) produces the UEPE of the 

ith symbol, where the IPE is readily available through the use of a delay line. It should be 
appreciated that the IPE and the UIPE are complex values and that the processing shown in Fig. 
4 is performed on each of the real and imaginary components of the IPE in parallel. 
Accordingly, each of the illustrated elements may be viewed as performing the individual 

3 0 processing operation on a complex value. 

Figure 5 is a flowchart 500 illustrating an exemplary method of operating a 
receiver, implemented in accordance with the invention, to perform demodulation operations in 
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accordance with the present invention. The method starts in step 502, where the receiver is 
powered on and initialized. Operation proceeds to step 504, where the receiver is operated to 
receive a first set of complex values, said first set of complex values being received complex 
symbol values of signals communicated over a channel, e.g., a wireless communications uplink 
5 channel. In some embodiments, the first set of complex symbol values are produced by an 
OFDM modulated communications system. The communication may be a block-coherent 
communication system. In some embodiments, each set of complex values may correspond to 
seven received symbols. Other size blocks are possible. In some embodiments one of the 
received signals may be a known symbol, e.g., a pseudo pilot symbol or a pilot symbol. In step 

1 0 506, the receiver is operated to store each of the first set of complex symbol values for a second 
predetermined time, said second predetermined time being longer than a first predetermined 
time. The second predetermined time is chosen to be long enough so that the first set of values 
will be available when generating the fourth set of values. The values may be stored in a delay 
line. Operation proceeds to step 508. In step 508, a set of soft bits, e.g., generated from a 

1 5 decoder is received. The received soft bits correspond to said complex symbol values. In some 
embodiments, there are at least two soft bits per received complex symbol value. In other 
embodiments, there are at least three soft bits per received complex symbol value. 

In step 510, a multiplication module performs a complex multiplication operation 
2 0 on each of at least some of said first set of complex values to generate at least some elements of 
a second set of complex values, the multiplication operation performed on each individual one 
of said set of at least some of said first set of complex values including multiplying said 
individual complex values by a complex value determined from at least some of said soft bits. 
Note that the multiplication may not be performed on the known symbol, e.g., pilot or pseudo 

2 5 pilot symbol, with a predetermined value being output instead. In some embodiments, each 

complex multiplication operation is performed by performing two shift operations and an 
addition operation. In some embodiments, one of the received complex symbol values is not 
rotated, e.g., the phase is unchanged by the multiplication operation. In some embodiments, the 
complex symbol value that is not rotated occurs in a pre-selected location within the set of 

3 0 received complex symbol values. The multiplication for the complex symbol value that is not 

rotated may be known and independent of the soft bits. The complex symbol value that is not 
rotated may be a value representing a pilot symbol or a value representing a known symbol such 
as a pseudo pilot symbol. In step 512, the receiver is operated to store each of the generated 
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second set of complex values for a first predetermined time. The first predetermined time in 
some embodiments is selected to be long enough to complete generation of the third set of 
complex values. 

5 Next, in step 514, the receiver is operated to sum said complex values in said 

second set of complex values to generate a complex sum, said complex sum being a complex 
value. Then in step 516, the receiver is operated to store the complex sum. Next, in step 518, a 
third set of complex values is generated by subtracting from said stored complex sum one of 
said stored second set of complex values to generate one of said third set of complex values, said 
10 third set of complex values having the same number of complex values as said first set of 

complex values. Each of the second set of complex values is separately subtracted from the sum. 
The subtractions may be done sequentially or in parallel. 

Next, in step 520, each element of said first set of complex values is multiplied 
15 with the conjugate of a complex value from said third set of complex values to generate a fourth 
set of complex values, said fourth set of complex values having the same number of elements as 
the first and second sets of complex values, said complex values in said fourth set being soft 
symbol values. Operation proceeds from step 520 to step 522. 

2 0 In step 522, a soft input soft output decoding operation is performed on said soft 

symbol values to generate additional soft bits. In some embodiments, the soft input soft output 
decoding operation may be performed by a Low density parity check (LDPC) decoder. In other 
embodiments, the soft input soft output decoding operation is performed by a turbo decoder. 

2 5 Next in step 524, the generated additional soft bits are used to process another set 

of complex symbol values. This may involve a repeat of steps 504 through 522. Operation 
terminates in step 526, e.g., when there are no more complex values from the communication 
channel to be processed. 

3 0 Figure 6 is a block diagram 600 illustrating exemplary sets of information that are 

generated and processed in accordance with the present invention. Block 602 is an exemplary 1 st 
set of complex symbol values. First set of complex symbol values may be from a received set of 
block coherent channel signals, e.g., seven symbols including one known symbol, e.g., a pseudo 
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pilot signal or a pilot symbol, and six symbols conveying user information. Block 604 is an 
exemplary set of soft bits from the decoder, e.g., soft bits corresponding to the received complex 
symbol values of block 602. 

5 Block 606 is an exemplary 2nd set of complex values, which includes at least 

some elements which are the result of a complex multiplication operation between at least some 
of the elements of 1 st set of complex values 602 and complex value(s) determined from at least 
some of the soft bits 604. 

10 An exemplary complex sum 608 is generated from the second set of complex 

values 606. 

Block 610 is an exemplary third set of complex values, the size of the 3 rd set of 
complex values being the same as size of the first or second set of complex values. Each 
15 element in the 3 rd set of complex values 610 is generated from a subtraction operation between 
the complex sum 608 and a member of the 1 st set of complex values 602. Block 612 is an 
exemplary set of values which are conjugates of the the 3 rd set of complex values 610. 

Block 614 is an exemplary 4 th set of complex values, e.g., soft symbol values, 

2 0 which may be generated by a complex multiplication operations between the 1 st set of complex 

values 602 and the conjugates of the third set of complex values 612. Additional soft bits 604' 
may be generated as the result of a soft input soft output decoding operation performed on soft 
symbols 614. The additional soft bits 604' may be used to process another exemplary set of 
complex symbol values 602'. 

25 

In various ones of the above described embodiments, no rotation, e.g., change in 
phase, is caused by the first complex multiplication performed by complex multiplier 303 when 
processing a symbol value correspond to a known fixed value, e.g., a pilot value. This normally 
applies in the case where the known value has no (e.g., zero) imaginary component. In other 

3 0 embodiments, where the known value has an imaginary component, the first complex 

multiplication operation may introduce a change in phase, e.g., a phase rotation, by a fixed 
predetermined amount. 
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Various features of the present invention are implemented using modules. Such 
modules may be implemented using software, hardware or a combination of software and 
hardware. Many of the above described methods or method steps can be implemented using 
machine executable instructions, such as software, included in a machine readable medium such 
5 as a memory device, e.g., RAM, floppy disk, etc. to control a machine, e.g., general purpose 
computer with or without additional hardware, to implement all or portions of the above 
described methods, e.g., in one or more communications network nodes. Accordingly, among 
other things, the present invention is directed to machine-readable medium including machine 
executable instructions for causing a machine, e.g., processor and associated hardware, to 
10 perform one or more of the steps of the above-described method(s). 

Numerous additional variations on the methods and apparatus of the present 
invention described above will be apparent to those skilled in the art in view of the above 
description of the invention. Such variations are to be considered within the scope of the 

o 

15 invention. The methods and apparatus of the present invention may be, and in various 

embodiments are, used with CDMA, orthogonal frequency division multiplexing (OFDM), 
and/or various other types of communications techniques which may be used to provide wireless 
communications links between access nodes and mobile nodes. In some embodiments the 
access nodes are implemented as base stations which establish communications links with 

2 0 mobile nodes using OFDM and/or CDMA. In various embodiments the mobile nodes are 

implemented as notebook computers, personal data assistants (PDAs), or other portable devices 
including receiver/transmitter circuits and logic and/or routines, for implementing the methods 
of the present invention. 



